2023-12-12 14:30:58.AIbase.4.1k
Zhipu AI Releases CritiqueLLM Scoring Model to Evaluate Text Generation Model Performance
Zhipu AI has launched the high-quality, low-cost scoring model CritiqueLLM. Traditional evaluation metrics such as BLEU and ROUGE lack an understanding of overall semantics. CritiqueLLM proposes an interpretable and scalable text quality evaluation model that outperforms other models across 8 common tasks. CritiqueLLM generates scores through methods such as user query augmentation, reference text evaluation data collection, non-reference text evaluation data rewriting, and model training.